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Searching for a signal depending on unknown parameters in a noisy background with matched 
filtering techniques always requires an analysis of the data with several templates in parallel in 
order to ensure a proper match between the filter and the real waveform. The key feature of such an 
implementation is the design of the filter bank which must be small to limit the computational cost 
while keeping the detection efficiency as high as possible. This paper presents a geometrical method 
which allows one to cover the corresponding physical parameter space by a set of ellipses, each of 
them being associated to a given template. After the description of the main characteristics of the 
algorithm, the method is applied in the field of gravitational wave (GW) data analysis, for the search 
of damped sine signals. Such waveforms are expected to be produced during the de-excitation phase 
of black holes - the so-called ’ringdown’ signals - and are also encountered in some numerically 
computed supernova signals. First, the number of templates ffl computed by the method is similar 
to its analytical estimation, despite the overlaps between neighbor templates and the border effects. 
Moreover, is small enough to test for the first time the performances of the set of templates for 
different choices of the minimal match MM, the parameter used to define the maximal allowed loss 
of signal-to-noise ratio (SNR) due to the mismatch between real signals and templates. The main 
result of this analysis is that the fraction of SNR recovered is in average much higher than MM, 
which dramatically decreases the mean percentage of false dismissals. Indeed, it goes well below 
its estimated value of 1 — MM^ used as input of the algorithm. Thus, as this feature should be 
common to any tiling algorithm, it seems possible to reduce the constraint on the value of MM - 
and indeed the number of templates and the computing power - without loosing as much events 
as expected in average. This should be of great interest for the inspiralling binaries case where the 
number of templates can reach some hundreds of thousands for the whole parameter space. 

PACS numbers 04.80.Nn, 07.05.Kf 


I. INTRODUCTION 

In the next years, the first generation of large interferometric gravitational-wave detectors [1-5] should reach a 
sensitivity good enough to expect the first direct detection of GW signals. In parallel of the experimental work 
consisting in operating the detectors at their working point with background noises as small as possible, the future 
data analysis methods are being prepared for the various expected sources of GW, each of them requesting specific 
tools. Yet, the Wiener filtering is used in most of these fields due to its ’optimal’ characteristics for signals whose 
time evolutions are known. Indeed, it is not only the filter giving the highest signal-to-noise ratio (SNR) among all 
the linear ones [6] but it has also the property of having the lowest false dismissal rate for a given false alarm rate 
among all filters [7]. 

Gonversely, its main drawback is its poor robustness: as soon as the physical signal and the filter do not match 
exactly, the SNR can be dramatically reduced. Even if the searched waveform is analytically computed with high 
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precision, it always depends on a vector of parameters^ A whose values are specific to a given source and thus unknown 
- e.g. its mass, the main frequency of emission... - and whose accurate estimation is indeed a major aim of the data 
analysis. 

The set of physically possible values for the vector A is the continuous parameter space P. A given filter can only be 
’efScient’ - i.e. recovering a large fraction of the SNR - in a restricted region of this space, called the efficiency area. 
Therefore, many different templates must be used in parallel for a matched filtering procedure, in order to cover the 
whole parameter space. The lattice must be dense enough to ensure a minimal loss in SNR for any real signal whose 
parameters do not exactly match any of the available filters, while the number of templates fR must be kept small to 
limit the computing power needed for the analysis. 

Given these two requirements, it clearly appears that the choice of the set of templates is very important as it has 
major consequences on both the ultimate filtering performances and its feasibility. On the other hand, ’tiling’ P is 
most of the time very difhcult as the efficiency area E of a filter depends on A and on the minimal match MM - see 
Section II A. Thus, in the general case, uniform coverage is not possible. 

In the one-dimensional case (Dp = 1) the tiling is easy as the efficiency areas are straight segments of various 
lengths which can be aligned one after the other one - see e.g. [8] where the matched filtering with Gaussian peak 
templates (only depending on their width) is studied. As soon as Dp > 2, one has to cope with overlaps, holes and 
borders. 

The problem of computing a set of templates for matched hltering purpose has already been studied for the compact 
inspiralling binaries search. The GW signal is accurately estimated thanks to various development methods - Taylor 
[9] or Fade [10] post-Newtonian (PN) expansions - and both the estimated SNR and the event rate make such events 
good candidates for a hrst detection. If the spins are neglected and the orbit circular, the parameter space is two- 
dimensional (the two masses of the stars) and the number of templates can be very high [11] if the low mass region 
is included in P. 

Using the formalism dehned in Ref. [12], Owen and Sathyaprakash present a method [13] to cover this parameter 
space at the 2 PN order. To solve the question of placing new filters with respect to the previous ones, rectangles 
inscribed inside the efficiency areas are used instead of the real ellipses, bigger but more difficult to place accurately 


^By parameters we mean here the intrinsic ones; we assume that all the extrinsic variables such as the signal timing arrival 
can be properly maximized numerically. 
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- i.e. without holes. 


In this paper, a different tiling approach is studied for the case Dp = 2. The main idea is to construct the lattice of 
filters with an iterative method ensuring a (rather optimal^) local template placement. The location of the first ellipse 
center is arbitrary and chosen by the user. Then the procedure goes on and new ellipses increasing the coverage of 
P are added one after the other. When it stops, the tiling of the full parameter space is normally complete - P fully 
covered - but is certainly not optimal from the global point of view. As described in Section II, the overlapping in the 
final configuration can be so important that a large fraction of ellipses are in fact completely covered by other ones 
and thus useless for the detection purposes, while wasting computing time. 

Therefore, a second step is then started to clean the list of templates by selecting these ellipses and erasing the 
corresponding filters. In the end, the final set of templates - all useful - only depends on the position of the first 
filter in the parameter space from which the locations of all the other ones have been iteratively computed. Due to 
border effects which cannot be properly estimated, it is impossible to define the best initial position. So, a last trick 
to further decrease the number of templates is to merge some sets of templates corresponding to different starting 
points and to apply to the full list the reduction procedure previously mentioned. 

As our team is mostly involved in the search for GW bursts (short GW signals usually lasting a few milliseconds 
for which waveforms cannot be predicted as accurately as for coalescing binaries, emitted by e.g. supernovae or the 
merging phase of compact stars inspiralling one around the other), this algorithm was originally developed to search 
for damped sine-like signals. Indeed, such time-behavior can be seen in a large fraction of supernova explosion GW 
signals computed numerically - see e.g. the waveforms of Ref. [14] - and is also expected to occur when excited black 
holes come back to equilibrium [15,16]. The latter case will be used in the following as a benchmark of the tiling 
method performances; details are presented in Section III. 

Another interesting feature of this example is that the number of templates remains small even when the allowed 
mismatch loss of SNR 1 — MM is kept very low. Thus, it is possible to check the quality of the tiling (with respect to 
the prescription originating the tiling method) and to see how the characteristics of the set of templates evolve when 
MM changes. This experience may be very useful for the inspiralling binaries search whose tilings cannot be tested 
so easily - yet, they must show similar behaviors. 

In particular, the most significant result we obtain is that one should be able to reduce the constraint on the 


^In the sense that the local number of templates is minimized for circular and slowly varying efficiency areas. 
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template spacing while keeping small the mean false dismissal rate of events, ultimately the only important quantity 
in the search of rare signals occurring at random time. Therefore, a much less numerous set of filters would still be 
efficient enough, but at a smaller computing cost. 

II. DETAILS ON THE TILING ALGORITHM 

After the introduction of some hypothesis and useful notations, the main steps of the tiling method are presented. 

A. Tiling formalism 

1. Hypothesis and notations 

As a linear filtering consists in correlating the detector output s(t) with the corresponding filtering function ip{t), 
one defines the following scalar product to represent the hltering operation: 

where Sh is the one-sided power spectrum density and the symbol means Fourier transform. The normalization 
is chosen so that, in case of signal alone, a/(s|s) is equal to the SNR. 

If one now assumes that ip(t) is almost monochromatic signal of frequency /o, one can show that if the noise 
spectrum is nearly flat around /o, Eq. (2.1) becomes approximately equal to the following equation: 


(slv^) = -FTTT / s{t)(p{t)dt (2.2) 

'Jhuo) Jr 

As the definition of the ambiguity function involves normalized templates - see Section IIA 2 -, the frequency- 
depending factor Shifo) vanishes. Thus, representing the hltering operation by a scalar product in the time domain 
is in fact accurate also with a colored noise for signals with a narrow extension in the frequency domain. This will be 
approximately the case for the damped sine waveforms - see Ref. [17] for a detailed discussion - which will be used 
to test the tiling algorithm in the next sections. 


2. Ambiguity function 


The ambiguity function F between two templates et is dehned by: 
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r ; dX^ — {k^\ ) 


(2.3) 


The templates are normalized: {k^\k^) = 1 = { fc^+dA I ^X+dX )• ^ ^ measurement of the closeness between 

two templates. It can also considered as a way to see how well the template fcy can be used to detect the signal 
the ambiguity function is the mean fraction of the optimal SNR achieved at a given distance in parameter space. 

Following Ref. [12], if dX is small enough, the ambiguity function can be approximated by a second order power 
expansion - the first order is null as the expansion is performed around the absolute maximum of F. 


F (a ; dX) = 1 - ^ 5 m. dX^ dX'' (2.4) 

with g^i, dehning a metric on the parameter space P [18,12]. 

Finally, one defines the minimal match MM as the lower bound of the recovered SNRs, which means that the loss 
of the SNR due to the mismatch between the template and the signal must be kept below 1 — MM. Following Ref. 
[12], one can note that with this definition, a - of course pessimistic - estimator of the fraction of false dismissals £ 
is: 


£ = 1 - MM^ (2.5) 

For instance, with MM = 97%, one has £ ~ 10%. This value of MM is usually found in the literature as the 
correspondence with this particular value of £ is easy. 

This quantity MM is the only input parameter of the tiling procedure; it allows one to define precisely the efficiency 
area E(A, MM) of the template k^ by the following equation: 


ig^^dA^dA"^ < 1 - MM (2.6) 

The area of P including all the vectors which match this inequality is the inner part of an ellipsoid centered on 

A whose proper volume scales as (1 — MM)”-®*"/^. The average fraction of recovered SNR for physical signals with 
parameters belonging to E(A, MM) is at least equal to MM. 
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3. The tiling problem 


Tiling the parameter space consists of finding a set of 9T filters [k.- ] so that the union of the surfaces 

^E(Ap,MM)^ completely covers P. The aim is to achieve this task by using as few templates as possible to 

keep the computing cost manageable. To do so, one has to solve two related questions: 

• which tiling algorithm to use? 

• how to test its quality? 

The main problems encountered by any tiling procedure are: overlapping, gaps, areas lost beyond the physical 
parameter space... The difference between the ellipsoid and the real border of the efficiency area is not really taken 
into account: in practical cases for one-step searches, MM is close to 1 and thus the ellipsoid is assumed to be a 
good approximation of the ambiguity surface. Moreover, the overlapping between close ellipses is an advantage to 
discard such a problem. In fact, the tests of the sets of templates constructed with the tiling method presented below 
show that the tilings have no significant holes, even with the choice MM = 0.85, which validates a posteriori this 
hypothesis. Clearly, the value oi MM from which the ellipsoid approximation of the efficiency area is no longer valid 
depends on the precise tiling problem considered. 

B. Covering the space parameter 

It is well-known that the optimal tiling of an infinite plane by identical disks of radius R consists in placing their 
centers on an hexagonal lattice, separated by a distance of \/3R - see Figure 1. In this way, the overlapping is 
minimized and the ratio rjopt between the sum of the disk areas and the surface effectively covered is 

^opt = ^ ^ ~ 1-21 (2.7) 

This value would increase in any real situation due to the border effect. 

This property of circular tiling is used by the algorithm on the following way. Once the location of a template kg 
in P has been chosen, one computes its efficiency area Eq- Through a simple plane transformation, Eq becomes a 
circle Cq of unit radius and the six neighbor centers are placed on the regular hexagonal lattice previously described. 
This choice is nearly optimal if the shape of the efficiency areas is slowly varying with respect to their characteristic 
dimensions. Using the inverse transformation, the six new center positions are given in P. Then, each ellipse associated 
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with a new center is tested in order to check if it covers a part of P not already covered by the previously defined 
ellipses. If the ellipse is useful, its center is added to the center list and will be used later to place other centers. For 
example, we have found that with MM = 97% only about 20% of the centers are kept. This iterative algorithm stops 
either when the full surface of the parameter space is covered or when no new ellipse can be placed any longer. 

C. Cleaning the template list 

A successfully completed coverage computed with the former procedure can be usually redundant in the sense that 
a large fraction of ellipses are completely covered by others. To save computing time, useless templates need to be 
identified and deleted. Like for the first step of the algorithm, one has to face the problem that erasing an ellipse is a 
local operation which can have global consequences: which template should be discarded first? 

To answer this question, the following procedure has been set: 

• For each ellipse E belonging to the tiling, one defines its utility T (E) which is the fraction of P it covers alone. 
Useless ellipses are thus characterized by T = 0. 

• Among those ellipses, the one with the smallest area in the parameter space is dropped. This prescription is 
consistent with the idea that it is a priori better to keep big ellipses which have a higher potential of coverage. 
Of course this general rule may not always be true but it looks reasonable. 

• Utilities are then updated for the surrounding ellipses as they become more “useful” - T(E) locally increases. 
Then, this scheme is iterated until all the remaining ellipses have non zero utilities. 

This second step of the tiling algorithm is very important: for most of the simulations in the damped sine case, the 
number of templates is reduced by a factor of about two with respect to the first list. 

D. Merging template lists and final cleaning 

Finally, the output of the whole procedure is a set of templates completely covering the parameter space and 
which depends on only one initial condition, that is the location of the first filter set at the beginning of the tiling 
generation. Once more, this choice may have global consequences on the tiling quality - for instance, shifting it 
’horizontally’/’vertically’ may cause a column/line of templates to appear or vanish - but cannot be simply optimized. 
Therefore, tilings starting at different points on the parameter space are computed in parallel; then, all the lists of 
templates are merged in a single one. Finally, the cleaning procedure is applied to this clearly redundant set of filters 
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to obtain the final lattice of templates. Thanks to this last step, its size is decreased by 10 or 15% with respect to the 
individual coverages in the damped sine case. 

In the damped sine case, five different tilings have been merged together to compute the final lattice of templates. 
There seems to be a good compromise between gain in the number of templates and computation time (more than 
80% of the ellipses of the full list are useless and so the cleaning procedure is much longer). This last merging step 
doubles the total computation time of the tiling, i.e. its duration is of the same order of magnitude as the sum of the 
computation times for the initial tilings. 


E. Estimation of the tiling quality 

Three variables can be used to control the quality of the tiling. 

• The number of templates ffi. 

• The ratio ?7tot between the sum of all the ellipse areas and the parameter space surface. 

• The ratio ? 7 in between the sum of all the ellipse areas inside P and the parameter space surface. 

One has clearly ? 7 in < These two estimators allow to measure the overlapping between templates and the 

fraction of extra area outside the parameter space, while 91 can be compared with its estimation computed by 
integrating the proper volume of the parameter space. Due to the imperfections of the real tiling (overlapping, border 
effects...) and to the fact that the analytical computation of the template numbers is only approximative by principle, 
it is interesting to check the consistency of the two numbers. 

III. MATCHED FILTERING DETECTION OF DAMPED SINUSOIDAL SIGNALS 
A. Normal modes of black hole oscillations 

An excited black hole, born e.g. after a supernova collapse or the merging of two compact objects, comes back to a 
stationary state by emitting GW. This emission can be described as a superposition of black hole quasi-normal modes 
[15,16]. The dominant mode is expected to be quadrupolar with the longest damping time [19]. In order to limit the 
number of free parameters defining the matched filtering parameter space, one assumes that after some transitory 
phase the waveform becomes (with a proper choice of time origin): 



h{t) (X exp (-) siii(27r/i) 


( 3 . 1 ) 


It was observed [20] that a given couple (/, t) is connected to one single set of physical parameters (Mbh, obh), the 
mass and the reduced angular momentum of the black hole. Moreover, the corresponding relation can be expressed 
analytically with a 10% precision [19,21]. Introducing the quality factor Q = nfr, one has in geometrical units 
(G = c=l): 


Q « 2 (1 — ubh) 
1 


/ 


1 — 0.63 (1 — obh) 


3/10 


(3.2) 

(3.3) 


2 TT L 

The normal mode frequency is, as expected, a decreasing function of the black hole mass, increasing with the 
rotation parameter as the quality factor, which is also independent of Mbh- Therefore, detecting such GW signal 
would give direct physical information on its source. 


B. Signal to noise ratio 


Following Ref. [17], two different methods estimating the optimal SNR give identical results: 

• a calculation in the direct space assuming that the noise is white at the oscillation frequency; 

• computing the Fourier spectrum and approximating it by a Dirac at the oscillation frequency. 
One has finally 


Mbh I Q 

Pmax OC y / X Shif) 

where Sh is the one-sided power spectrum density of the noise. The proportionality constant depends on the 
process at the origin of the black hole formation. For high mass inspiralling binaries, it can be very large 
leads to detections at cosmological distances, while for supernova collapses, it can be high enough to be detected in 
the Local Group [22,23]. 

Using this framework, it is sufficient to study the two quadratures and to compute the corresponding sets of 
templates covering the two dimensional parameter space P = {Q, f}- 


(3.4) 

physical 
[17] and 
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C. Tiling the two dimensional parameter space 


As in the black hole oscillation case, no correlation is assumed between the oscillation frequency / and the quality 
factor Q. Therefore, P is rectangular: 


= [Q min ^ ^max] ^ [fmin 5 J^max] 


Its borders are chosen in the following way: 


• For Q, by using the black hole normal mode range 


Qmin = Q(aBH = 0) = 2 

Qmax = Q(aBH = 0.99) « 16 


• For f, by using the interferometric detector main characteristics 


J /min = 20 Hz (roughly twice the realistic value for the Virgo seismic wall) 

I /max = 10 kHz (the Nyquist frequency corresponding to the Virgo sampling frequency of /samp = 20 kHz) 
The mismatch between two templates of parameters {Qo, fo) and {Q = Qo + SQ, / = /o + Sf) can be written^ 

« (Qo,/o) + 2/3 iQoJo) + 7 iQoJo) < 1 - MM (3.5) 

with 1 — MM being the maximal expected loss in SNR. The dimensionless coefficients a, f3 and 7 only depend on the 
quality factor Q. Their expressions are the following: 


• for pure damped cosines: 


a{Q) 

m) 

i{Q) 


1 64 gs + 128Q6 + 28g‘‘+ 1 

8 (l + 4Q2)2(i + 2g2)2 

1 8^"^ + 2g^ +1 

"8 (l + 2g2)(i + 4Q2) 

1 I6g4 + 6g2 + i 

8 1 + 2g2 


^From the usual definition of the metric on the parameter space - see Eq. (2.4) - there is a factor 2 between the ellipse 
coefficients and the metric ones: qqq = 2alQ^, gqf = 2(5/Qf and gff — 27 //^. 
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• for pure damped sines: 


a{Q) 

PiQ) 

liQ) 


1 + 3 

8 (1 + 4 Q 2)2 
1 3 + 4g2 
8 i + 4g2 

8 g^ + 3 
8 


In the former case, assuming that the quality factor is much greater than 1, the first order expansion of these 
coefficients in power of Q gives the results computed in Ref. [24]. The hypothesis g ^ 1 is not valid in the total range 
[gmin = 2 ; gmax = 16] we have considered in this paper and the difference between the real coefficients and their first 
order approximations can reach 10 % in the region of low quality factor. 

The number of templates needed to cover the parameter space can be roughly estimated by the usual formula: 

^ ~ ^ / ^/gdx^^dx'' = Y f i/ay - /32 (3.6) 

y Jf y Jp go Jo 

where V is the template proper volume and g is the metric determinant. As V ~ 4(1 — MM), we can express this 
number with the help of MM: 




1 


2(1-MM) 


^ ^ ^ Qo fo 


(3.7) 


From Equation (3.7), it is possible to estimate the numbers of templates 91 needed to cover the parameter space. 
One gets at the first order in Q: 


91 


V 2 


In 


8(1 -MM) V/. 


fn 


{Q max Qmin) 


(3.8) 


for the two families of signals studied. The number of templates is logarithmic in the frequency band and scales 
linearly in the quality factor range. With the numerical border values previously defined, one gets 91« 500. 

The expressions giving the coefficients a, P and 7 clearly show a dependence on the ellipse location in the parameter 
space. Using the vocabulary of differential geometry, this is a strong indication that P is curved. Following Ref. [25], 
it is possible to verify this by calculating the curvature of the search manifold. As the waveform is a function of 
the parameters {g, /o} only, we have a two dimensional problem and can thus calculate the Gaussian curvature, K, 
directly. Following the Petrov classification, the Riemann tensor associated with the metric defined in Eq. (2.4) 

has only one independent component, which we can take to be i?i 2 i 2 ) in its fully covariant form. We can thus define 
the Gaussian curvature on the search manifold by 


R1212 

9 


(3.9) 
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where g is the metric determinant. For the damped sine case for instance, the Gaussian curvature is constant over P: 


K = A. Therefore, the search manifold has the same topology as a 2-sphere. 


D. Template number 


As the template tilings appear to be very similar for the damped cosine and sine signals - compare for instance the 
expressions giving the coefficients a, (3 and 7 -, we mainly focus on the latter in the following. 


Starting point (Q, /) 

Template number before cleaning procedure 

Final template number 94 

rjin 

^tot 

(2, 20 Hz) 

1992 

854 

2.05 

2.57 

(9, 4990 Hz) 

1570 

873 

Ewa 

2.61 

(16, 10 kHz) 

1532 

888 

2.11 


(14, 100 Hz) 

1986 

849 

EBBI 

2.51 

(3, 9900 Hz) 

1491 

857 

2.15 

2.74 


Table VII: Characteristics of tilings computed from different starting points 

Table VII shows the main characteristics of some parameter space coverages computed with the tiling procedure 
presented in this paper, starting from different initial points in P. The number of templates generated by the first 
phase of the algorithm to cover a given parameter space depends greatly on the initial point chosen in P: one can see 
variations of about 20%. By comparing the number of templates before and after reduction, the importance of the 
cleaning procedure becomes clear: about 50% of filters belonging to the initial set are finally rejected. The reduction 
by a factor 2 was surprising even though it was supposed that each ellipse is placed in a nearly optimal way. In fact, 
for the damped sine case, the assumption that the shape of the efficiency areas are slowly varying with respect to 
their characteristic dimensions is far from true as shown in the zooms of Figure 2. 

Border effects play a limited - although non zero - role for the size of the tiling as Tt changes by a few percent 
(between 5 and 10% at most) with the position of the initial templates. Yet, comparing and rjtot allows one to 
see that a large fraction of the ellipse potential - i.e. not taking into account overlaps - areas is lost outside of P. 
Moreover, the values of the two ’quality’ estimators r/in and r^tot remain much higher than for the optimal disk tiling 
which shows that ellipses do not tile in the best way. 

With these different tilings, the merging procedure presented in Section IID has been applied. As expected it allows 
one to decrease 01 by about 15-20% more - see Table VIII. The reason for this gain can be understood by looking at 
the two estimators ? 7 in and rjtot- their values decreased in the final step. Therefore, the template efficiency areas have 
less overlaps and better match the border of the parameter space. 



Full list size 

91 after final cleaning 

rjm 

‘^tot 

Damped sines 

4341 

698 

1.74 

2.12 

Damped cosines 

4346 

720 

1.87 

2.26 
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Table VIII: Characteristics of the best results achieved by merging together lists of templates with different starting points. 

The last point to be mentioned here is that even if the ellipse characteristics change a lot in the parameter space, the 
algorithm succeeds in tiling the whole P. Figure 2 presents two zooms of the coverage; the first one (top) corresponds 
to a region with small quality factors Q and quite high frequencies / where ellipses are large. The second (bottom) 
focuses on the area where Q is high and / below 3 kHz; there, the ellipses are very narrow in the frequency direction, 
which is simply due to the fact that a sine-like signal of ’small /’ and high quality factor must be tracked with a very 
good accuracy: the frequency difference between the template and the signal must remain very low. Ultimately, at 
Q —>■ -foo, the correlation would be a delta-function in the frequency difference. 

E. Tests on the sets of templates 

A way to estimate the quality of a set of templates computed by the tiling algorithm is simply to check how it 
fulfills the input requirement of minimizing the loss of SNR for the detection of any signal belonging to the parameter 
space. Therefore, assuming a uniform distribution of the signal parameters in P, Monte-Carlo simulations have been 
performed in order to compute the distribution of the fraction of SNR recovered by matched hltering with the two 
banks of hlters. For each signal with random parameters, correlations are computed with all the templates and the 
higher match is selected. Ambiguity functions are hrst maximized on the time shift between the signal and the filter 
- both sampled at the Virgo sampling frequency, /samp = 20 kHz. No additional noise is needed for this optimization 
as one wants to estimate the mean loss of SNR. 

The results can be seen on Figure 3. By comparing them to the original value of the minimal match used to generate 
the sets of filters, MM = 97%, two important remarks can be made. First, the mean recovery fraction (j) is ’much’ 
higher than MM, more than 99% for both the sine bank of templates and the cosine one. Then, a very small fraction 
of signals (less than 0.05% in both cases) shows SNR losses higher than 1 — MM; yet, the recovered SNR does not 
go below 96.6%, a value very close to MM. 

So, the overlapping between templates seen both on Figure 2 and through the high values of the estimators ? 7 in and 
rytot allows a better match between filters and signals. Moreover, the tilings show almost no ’holes’ and guarantee an 
SNR loss lower than 3.4%. Therefore, they appear satisfactory for data analysis purposes. 
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F. Decreasing the minimal match MM 


If necessary, a way to decrease the number of templates would be to cover the parameter space with a less stringent 
requirement on the minimal match as the estimated number of templates scales as (1 — MM)~^ - see Eq. 3.8. Indeed, 
the tilings presented in the previous sections with MM = 0.97 show that in average the SNR loss remains much 
smaller than 3%. Therefore, the mean estimated fraction of events lost because of mismatches between the finite set 
of templates and the physical GW signals £ is well below £ = I — MM^ ~ 10%. One has for the physical signals: 


£ = 1 - <^3 


(3.10) 


where the over-line represents an average over the GW signals and the maximum of (j) is taken over all templates. 
One gets £ ~ 2.3% for the cosine filters and 2.6% for the sine filters. Then tilings with smaller values of MM are 
computed and their performances for the SNR recovery. 


MM 

97% 

96% 

95% 


92% 

90% 


91 

698 

544 

448 

383 

300 

230 

170 

£(%) 

2.6 

3.3 

4.2 

4.8 

6.2 

7.5 

10.7 


Table IX: Template numbers and fraction of events lost for tilings with different values of the minimal match parameter MM 
Table IX shows the results achieved for the case of sine filters with MM values between 85% and 97%. The evolution 


of the tiling characteristics with respect to MM can also be seen on Figure 4. For each value of the minimal match, 
a procedure similar to the MM = 97% case has been followed: computation of 5 different tilings corresponding to 
different starting points and then merging of the template lists to reduce the number of Hlters needed. 

The first thing to note is that 91 clearly decreases with MM and follows quite perfectly the expected behavior - 
see Eq. (3.7). With respect to the ’theoretical’ number of templates, 91 is slightly larger, due to border effects. Yet, 
they affect in the same ’relative’ way the tilings corresponding to different values of the minimal match and so the 
relation 91 oc (I — MM)~^ is preserved. The top graph of Figure 4 shows this relation; the line corresponds to the 
best linear Ht computed with the available data. One gets: 


91 


20 

1 - MM 


+ 43 


(3.11) 


The slope of the function is of the same order of magnitude as the estimated value given by Eq.3.8: 
^ In {^) (Qmax - Qmin) « 15.4. 

The second result - and perhaps the most important - is that even with MM = 0.90 the estimated fraction of 
lost events is smaller than the expected 10%, as shown on the bottom graph of Figure 4. With a template number 
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reduced by a factor of 3, the parameter space coverage remains well inside the initial specification: £ < 10%. It is 
only with the choice MM = 0.85 that £ slightly exceeds 10%, but with a reduction of a factor 4 in the number of 
templates. Numerically, one obtains 


£ « 1 - MM’^ with K = 0.74 (3.12) 

This feature is of small importance here as the total number of templates 01 is only a few hundred but would be 
essential for the compact inspiralling binaries searches. In this case, for a total mass of two solar masses, the number 
of templates is expected to be close to 10^ or even more depending on detector bandwidth ~ see e.g. Ref. [11]. Any 
tiling method based on the minimal match criterion like the one developed in this paper should provide a set of 
templates able to recover on average a fraction of SNR much better than MM - the Minimal Match indeed -, and 
thus miss less events than assumed during the building phase of the lattice. 

Therefore, by using a Monte-Carlo study of the generated tiling performances, it should be possible to decrease by 
a large fraction the number of templates really needed to guarantee that for instance 90% of the events potentially 
detectable will be seen. Such a refinement appears to be a way to save a large amount of computing time and would 
simplify any data analysis strategy based on matched filtering methods. 

G. Discussion 

As known since many years, mean losses in SNR recovery are well below 1 — MM, which indeed implies that the 
mean fraction of lost events is much smaller than 1 — MM^. In the previous section, the evolution of 1 — £ was 
shown to be also well-fitted by a power law MM”, with the exponent k = 0.74. It would be interesting to see if 
such dependences could be analytically foreseen, without any simulation and without any assumption on the tiling 
problem. 

Let us consider first the simpler case of a single parameter x (Dp = 1). One then assumes that the match is well 
approximated by its quadratic expansion around a given template located at a; = 0. Thus, the efficiency area E is 
bounded by the equation: 


1 — kx^ > MM 
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with MM the minimal match and k the metric coefficient. E is a segment [—Xmax! a^max] with Xmax = \/(l — MM)/k. 
Assuming a uniform distribution of physical signals in this area, it is easy to compute the mean fraction of recovered 
SNR: 


2 Xn 


dx (1 — fcx^) = 1 — 


1 - MM 


(3.13) 


With MM = 97%, one gets <j) = 1% which is very close to the value of 0.9% achieved in our simulations. It is worth 
noting that the result is independant of k (and so of the particular efficiency area considered), which validates the 
assumption of computing the mean value of the fraction of SNR recovered. 

One can estimate £ in the same way: 


£ = 1 - 


2 Xr, 


dx{l - kxy = (1 - MM) - 


3(1 - MM)2 (1 - MM) 


+ 


7 


(3.14) 


The choice of MM = 97% gives £ = 2.9%, while with MM = 85%, one has £ = 13.7%. These values exceed the 
results of the numerical simulations (respectively 2.6% and 10.7%) but are quite comparable. 

How do these results change with a two-dimensional parameter space (x, y) - which is indeed the situation considered 
in this article? By a proper choice of coordinates, one can - like for the ID-case - assume that the chosen template 
is located at (0, 0) and that the efficiency area is defined by the equation of an ellipsoid written in its simplest form: 


62 


< 1 - MM 


Straightforward calculations allow one to compute both the mean fraction of recovered SNR <j) and the mean loss of 
events £: 


1 - MM 


1 - MM^ 
4(1 - MM) 


1 -f MM + mm2 
4 


(3.15) 

(3.16) 


Like for the one-dimensional case, these expressions do not depend on the particular template and are thus estimators 
of the mean values of these quantities. With MM = 97%, one gets values higher than those computed numerically: 
<j) = 1.5% and £ = 4.4% respectively. The discrepancy between these numbers and those given in the case Dp = 1 is 
higher, even if the tiling studied here is 2-dimensional. 
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The effect of the overlap between close templates for Dp = 2 has been checked with the fully simplified model 
presented in Figure 1: efficiency areas are identical disks of a given radius. In this case, a more accurate expression 
of £ can be computed analytically by selecting for each physical signal the closest template, i.e. the one allowing 
to recover the highest fraction of SNR. But the gain is very small: the fraction of false dismissals decreases only by 
about 1-2% and remains in all cases higher than for the ID-model. In fact, one could not expect much more from 
this rehnement. Figure 1 shows that only a fraction (1 — 27r/3-\/3 « 21%) of the physical signals are affected by this 
improvement and that these particular signals have the worst matches. 

Finally, it appears that the good performances in detection efficiency of the template bank are the consequence of two 
aspects: firstly, the parameter space considered in this study is much more complex than the ideal case of a parameter 
space where one could use a uniform lattice of templates, and secondly the generated tilings remain redundant despite 
all the steps used to reduce it as much as possible. There is certainly still room for future improvements for the 
geometrical tiling algorithm. 

The comparison of the numerical data with the analytical parameterization computed for the model Dp = 2 (2D) 
is also shown on Figure 4. On this plot, it is clear that it overestimates the SNR losses with respect to the simulated 
tiling performances; yet, it gives values closer to real data than the I — MM^ curve. Thus, the fractions of false 
dismissals are also higher. 


IV. CONCLUSION 

This paper presents a method to tile any two dimensional parameter space P with a given maximal loss in SNR. 
The template list is build in two steps: the hrst one is an iterative algorithm which provides a complete coverage 
of P without any area left uncovered. As the computed set of hlters can be redundant, a second step including two 
’cleaning’ steps allows one to significantly decrease the number of templates by dropping those which are useless. This 
algorithm has been used for the lattice of templates needed to look for damped sine signals with matched hltering. 
Even if the number of filters involved is by some orders of magnitude below those computed for the inspiralling binaries 
search, this choice of waveform allowed us to strongly test the capability of the algorithm as the ellipse characteristics 
show large variations in P or, equivalently that P is curved. Moreover, the algorithm using these templates can be 
directly implemented in an on-line filtering. 

As it is not possible to choose the optimal tiling of a given parameter space, the final configuration of filters is 
computed by mixing various sets of templates computed with different initial conditions (the location of the initial 
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filter used to develop the iterative algorithm) and by applying to the global list the reduction procedure. This kind of 
’average’ decreases the number of filters by about 15% with respect to an unique procedure. The number of templates 
finally computed is comparable to the analytical estimations and Monte-Carlo simulations show that the set of filters 
fulfills the initial requirement: minimizing the loss in SNR in the whole parameter space. 

In the damped sine case, the study of the sets of templates shows that the effective loss of events £ (as a function 
of MM) is much less than its usual estimation: £ approximately behaves as 1 — MM^-'^'^ instead of 1 — MM^. With 
the usual prescription of 10% for the loss, the number of templates can be decreased by a factor close to 4. Seen 
in a reverse way, for a given MM prescription, the SNR fraction recovery is on average much higher than MM: 
99.1% for MM = 97% as an example. Even if the exponent value depends on the specific waveform details, such a 
behavior should be expected for the inspiralling binary case. This feature can allow us either to decrease the number 
of templates (and then the computing power) or perhaps obtain a more robust (with respect to noise for example) 
tiling of the parameter space. 

Finally, simple parameterizations allowing one to predict the dependence in MM of the mean fraction of recovered 
SNR (j) and of the false dismissal fraction £ are presented. They are derived independently of the particular tiling 
method studied in this paper and should thus be generic. They do not give the power-law scaling of £ inferred from 
the results of our numerical simulations and they both overestimate the losses due to the finite lattice of templates 
w.r.t. the tilings we generated. It would be interesting to compare this situation with different tilings generated with 
other algorithms. 
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FIG. 2. Zooms of the tiling in two different regions of the parameter space: one can clearly see the large variations in the 
ellipse shape in P. 
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FIG. 3. Distribution of the fraction of SNR recovered by the bank of templates for physical signals with unknown parameters, 
uniformly drawn in P {MM = 0.97). 
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FIG. 4. Top: evolution of 91 versus 1/(1 — MM) showing the expected linear behavior. Bottom: false dismissal fraction 
versus the minimal match MM. Even for MM = 90%, the lost remains below the 10% level - assumed to be acceptable and 
at the origin of the common choice of MM = 0.97. The dashed lines present two possible estimations of T: £ = 1 — MM^ and 
the parameterization computed with the model Dp = 2. Both increase much faster than £ with 1 — MM; yet, the second one 
overestimates less the real data. In the two graphs, the continuous curves show the best ht of the data. 
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